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Abstract 

Consider the interchange process on a connected graph G = {V, E) on 
n vertices. I.e. shuffle a deck of cards by first placing one card at each 
vertex of G in a fixed order and then at each tick of the clock, picking an 
edge uniformly at random and switching the two cards at the end vertices 
of the edge with probability 1/2. Well known special cases are the random 
transpositions shuffle, where G is the complete graph, and the transposing 
neighbors shuffle, where G is the n-path. Other cases that have been studied 
are the d-dimensional grid, the hypercube, lollipop graphs and Erdos-Renyi 
random graphs above the threshold for connectedness. 

In this paper the problem is studied for general G. Special attention is 
focused on trees, random trees and the giant component of critical and su- 
percritical G{N,p) random graphs. Upper and lower bounds on the mixing 
time are given. In many of the cases, we establish the exact order of the mix- 
ing time. We also mention the cases when G is the hypercube and when G 
is a bounded-degree expander, giving upper and lower bounds on the mixing 
time. 
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1 Introduction 

Card shuffling has, for several decades now, been one of the major playgrounds for 
developing methods for the study of mixing rates for Markov chains. One of the 
first card shuffling techniques to be studied was the random transpositions shuffle, 
where at each (discrete) time, two positions in the deck are chosen uniformly 
at random and the cards at those positions are switched with probability 1/2. 
(Strictly speaking, this is the lazy version of the random transpositions shuffle.) If 
the deck consists of n cards and starts out from a fixed permutation, then an easy 
coupon collector's argument shows that the mixing time in total variation is at least 
n log n. Diaconis and Shahshahani dH proved that this is also sufficient, not only 
in total variation, but also in L^. Matthews [|24ll later proved, via a strong uniform 
time argument, that this is also sufficient in separation. Hence the total variation 
norm exhibits a sharp threshold at time n log n and hence Tmix = ( 1 + o ( 1 ) ) n log n. 

Another shuffle that was studied in the early days, is the transposing neighbors 
shuffle, where at each time a position i E [n — l]m the deck is chosen uniformly 
and the card in that position is switched with the card in position i + 1 with proba- 
bility 1/2. Aldous [IJ showed that the mixing time is at most of order log n and 
at least of order n^. Much later Wilson JSOl] introduced an important technique for 
lower bounding mixing time. This allowed him, among other things, to establish 
that the true order of mixing for transposing neighbors is log n. Wilson also 
considered neighbor transpositions on a grid and on the hypercube, proving that 
the order of mixing on the ^/n x y/n grid is log n and that the order of the 
mixing time on Z2 is at least n log^ n, where n = 2'^. Wilson conjectured that 
n log^ n is the correct order of mixing on the hypercube. We partially confirm this 
by showing that mixing time is 0{n n). 

Remark on notation. Consider two expressions f{n) and g{n) as n — )■ 00. We 
will adopt the notation f{n) = o{g{n)) when f{n)/g{n) — )■ 0. Writing f{n) = 
0{g{n)) means that that for some constant C < 00, f{n) < Cg{n) for all n. 
We write /(n) = VL{g{n)) for g{n) = 0{f{n)) and f{n) = uj{g{n)) for g(n) = 
o{f{n)). We will also use the abbreviation "whp" for "with high probability". I.e. 
as a parameter, which is understood from the context, tends to 00, an event that 
occurs with probability 1 — o(l) is said to occur whp. 

In the light of the above, it appears natural to consider neighbor transposition 
shuffling on a general connected n-vertex graph G = {V, E), i.e. the card shuffle 
that at each time chooses uniformly at random an edge e E E and with probabil- 
ity 1/2 switches the two cards at the end vertices of e. Indeed, this general setup 
was considered by Aldous and Fill [|2j|. Chapter 14, but their analysis did not go 
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further than what we have already mentioned, in terms of analyzing the process 
on particular G"s with respect to mixing time. Aldous and Fill introduced the 
name interchange process for neighbor transposition shuffling on general graphs. 
The interchange process has attracted a fair amount of interest in recent years. 
The most prominent result is perhaps the proof by Caputo, Liggett and Richtham- 
mer [7] of the Aldous spectral gap conjecture (see e.g. David Aldous homepage), 
which states in our setting, that the spectral gap of the interchange process on G 
equals the spectral gap of simple random walk on G times n for all G. It is not 
clear if this has any direct relation to the mixing time results in this paper. As will 
be seen in our main result. Theorem 11.41 below, the relation between the mixing 
times of the interchange process and of random walk is usually not the same as 
for the spectral gap. Indeed, much of our work will go into proving that in many 
cases mixing time for the interchange process exceeds the mixing time (and the 
spectral gap) of simple random walk by a factor n log n. More on this will follow 
in the remark after Theorem 1 1.4[ 

A recent contribution was given by Erikshed [[T4|. He considered the inter- 
change process on lollipop graphs with 6(n) vertices in the clique as well as in 
the handle and proved that Tmix = Q{n^ \ogn). The following result establishes 
that this is the highest possible order of mixing for the interchange process on any 
graph. 

Proposition 1.1 Consider the interchange process on G = {V, E). Let m = \E\ 

and p be the radius ofG. Then 

Tmix < (1 + o(l))8mpnlogn. 

Erikshed also considered the interchange process on the Erdos-Renyi G{n,p) 
random graph for p > \ogn/n, the threshold for connectedness. He proved that 
whp Tmix = ^{n log n) and when np = fi(n^), 5 > 0, then this is the true order of 
mixing. 

In this paper, we consider shuffling on many other graphs. The following 
general result will be our main tool for lower bounding mixing time. 

Proposition 1.2 Assume that 1 — 7, 7 = o{l/n), is the second largest eigenvalue 
for the motion of a single card under the interchange process on G. Let ^ : — )■ M 
be a corresponding eigenvector with \\i\\2 = 1 and assume that > n'^ for 
some a > 0. Then 

Tmix = ^{1'^ \ogn). 
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The proof builds on Wilson's technique, introduced in [|30ll (and developed fur- 
ther in Bn . [|2T]| and [|22||). As in Wilson's technique, we use the eigenvector to 
construct a test function, but since we will not know the precise form of the eigen- 
vector, we bound the variance of the test function in a different way. The condi- 
tion that the L^-norm of the second eigenvector is not too low is a mild restriction. 
An eigenvector corresponding to the second largest eigenvalue of the single-card 
chain, is also known as a Fiedler vector of the graph Laplacian, L(G) (in honor of 
the pioneering work of Fiedler, [15] and [16J) among graph theorists. The Fiedler 
vector has turned out to be of significance in graph theory, most prominently in 
graph partitioning. Intuitively, the L^-norm of a Fiedler vector should be of order 
y/n and I conjecture that it is at least of order for all graphs. However, even 
though many facts about the structure of Fiedler vectors are known, see e.g. O, I 
have not been able to find a result of this kind. Nevertheless, it is easy to establish 
the required lower bound on the L^-norm in most of our applications; 

Lemma 1.3 Let G = (V, E) be an n-vertex connected graph. Suppose that ^ : 
[n] — )• M is an eigenvector of L{G) corresponding to an eigenvalue k = 0{n~^) 
for some b > (i.e. is an eigenvector for the random walk of a single card under 
the interchange process on G, corresponding to the eigenvalue 1 — K/2m) and 
such that \\^\\2 = 1- Then = Q{n'^) for some a > 0. 

Our main result is the following, giving (bounds for) the mixing time on sev- 
eral different graphs. Here T^^ix denotes the mixing time in terms of total variation 
and f is the convergence time in L^. The precise definitions will appear in Section 
2. 

Theorem 1.4 Consider the interchange process on G = (V, E). Set n := \V\. 

(a) Let d = 9(logn) and let G be the first d generations of a rooted r -regular 
tree (i.e. when V consists of all vertices within distance d of a distinguished 
vertex of an r-regular tree), 3 < r = 0(1). Then 

il{n^ \ogn) = Tinix < = 0{n^ log^ n). 

This also goes whp when G is the first d generations of a supercritical 
Galton-Watson tree, with finite variance offspring distribution, conditioned 
to survive. 

(b) Let G be a uniform random (labelled) tree on n vertices. Then whp Tmix 
and f are both Qin^^"^ logn). This also goes whp for the d = Q{n^/'^) first 
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generations of the incipient infinite tree of a critical Galton-Watson process 
whose offspring distribution has finite variance. 

(c) Let G be the giant component of a G{N, (1 + e)/N) graph, where N = 
6(n^/^) and e = 0(A^~^/^). Then whp Tmix and f are both 9(n^/^ logn). 

(d) Let G be the giant component of a G{N, (1 + e)/A^) graph where N = e~^n 
and u{N^^/^) = e = o(l). Then whp 

T-mix = e(ne"^log^(e^n)logn). 

The same thing holds for f. 

(e) Let G be the giant component of a G{N, (l + e) /N) graph where N = <d{n) 
and e = 0(1), Then whp Xmix and f are both Q{n log^ n). 

(f) When G is a bounded degree expander, then Q{nlogn) = Tmix < 't = 
0(nlog^ n). 

(g) When G = Z2, d = log2 n, then log^ n) = Tmix < = 0{n \og^ n). 

Remark. In (b)-(e), the mixing time of the interchange process on G is of the 
order n log n times the mixing time of random walk on G, (see JH, [[TT]|. [[T2l. ifTTll 
and [26] for the mixing time of random walk on these graphs). I believe this to be 
true for (a) as well. On the other hand, in (f) the true mixing time appears to be 
n log n which is Q{n) times the mixing time for random walk. In (g), the mixing 
time for random walk is d log d, so the ratio between the two mixing times is 
between 6 (n log n / log log n) and 6 {n log^ n / log log ra) . It would be interesting 
to know what possible relationships one can have between the two mixing times. 
It would also be interesting to compare mixing times for the interchange process 
with mxing times for the symmetric exclusion process on the same graphs, for 
which new results can be found in [|27ll . 

The rest of the paper is organized as follows: the basic concepts are introduced 
in Section 2 and the proofs are given in Section 3. 



Mixing times for the interchange process 



6 



2 Preliminaries 

Let S he a finite set and u a signed measure on S with 1/(3) = and let vr be a 
probability measure on S. For p e [1, oo), the L^-norm of u with respect to vr as 



\^\\p 



The total variation norm of u is given by 



I ttH) 



2 AC5 

ies 

By Cauchy-Schwarz, 2||z/||Ty = < ll^lh- 

When {Xt} is an irreducible aperiodic Markov chain on S, with stationary 
distribution tt, we define the mixing time as 

Tmi. = min |t : \\¥{Xt E ■) - n\\Tv < \}- 
The convergence time in is given by 



f = min|t:||P(XtG-)-7r||2<^}. 



It follows that Tmix < T. 

All upper bounds in this paper rely on the comparison technique of Diaconis 
and Saloff-Coste [9|. Let {Xt} and {Yt} be two random walks on Sn generated 
by the symmetric probability measures fi and /io respectively. Let E and Eq be 
the supports of fx and /xq respectively. For each y E Eq, choose a representation 
y = X1X2 . ■ ■ Xk, Xi E E and k odd, and write \y\ = k. For each x E E and y E Eq, 
let N(x, y) be the number of times that x appears in the chosen representation of 
y. Define 

= max—- V \y\N{x,y)fio{y)- 

x€E n(x) ^ 

Then the following is a special case of Lemma 5 of ||9l 

Lemma 2.1 The distance in L'^ from stationarity of {Xt} and {Yt} respectively 
relate in the following way. 



2- 
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The most important consequence of the lemma is that, provided that tq = Q{n log n), 

f = O(Afo), 

where f and tq are the convergence times in of the two chains. Of course, 
one also needs a chain {Yt} to compare with for which tq is known. When it 
comes to card shuffling, the by far most common chain to compare with is the 
random transpositions shuffle. The following sharp result is due to Diaconis and 
Shahshahani [10]. 

Lemma 2.2 Let {Yt} be the random transposition shuffle. There exists a constant 
C < oo such that for t = \n{logn + c)J, 

myte-)-n\\i<ce-"^. 

Before starting with the proofs, recall the extremal characterization of eigen- 
values. For our purposes it suffices to recall the special case when A is the sym- 
metric transition matrix of an irreducible aperiodic Markov chain on the finite 
state space S. Then with K2 denoting the second largest eigenvalue of A, we have 
for the spectral gap 7 := 1 — ^2 

7 = min {eil - A)^ : ^e(^) = 0, Uh = l} 

and a vector ^ for which the minimum is attained is eigenvector for K2. In this 
paper this will be applied on the case where A is the transition matrix for the 
motion of a single card under neighbor transpositions on G = (V, E). The neigh- 
bor transpositions shuffle on G is the random walk on the symmetric group Sn 
of permutations of n cards, whose state changes from time t to time t + 1 in the 
following way. An edge e E E is chosen uniformly at random, independently of 
choices made at other times. Then, by another independent choice, the state at 
time t + 1 is the composition of the state at time t with (i j), where i and j are the 
cards at the end vertices of e at time t, with probability 1/2 and identical with the 
state at time t with probability 1/2. This entails that any given single card makes 
a random walk on V with the behavior that given that the card is at vertex u at 
time t, it will at time t + 1 be at m with probability 1 — du/ (2|-E'|) and at v with 
probability 1/(2|£'|) for each neighbor v of u. Hence, when A is the transition 
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matrix for the single card random walk, we have 

{u,v}eE 

where is the degree of vertex v and u v denotes that {u,v} G E. 

3 Proofs 

Proof of Proposition 11.11 This is a straightforward application of Lemma 12.11 
and Lemma |Z2l Let {Xt} be the interchange process on G and let {Yt} be the 
random transpositions shuffle. Let y = (u v) E Eq be an arbitrary transposition. 
Let wqWi . . .Wrhea shortest path between u and v (where wo = v and Wr = v). 
Then 

(U V) = {Wq Wi){Wi W2)... {Wr-2 Wr-l)(w^r-l ^i'r■)(w^^-2 ^r-l) ■ ■ ■ {Wq Wi). 

Hence \y\ < 2r — 1 < 2p and N(x, y) < 2. Since ii{x) = 1/ (2m) for any x E E, 
X ^ id. Lemma [XTI entails that r < (1 + o(l))8mpfo), which by Lemma [Z2l is 
(1 + o(l))8mpn logn). □ 

Proof of Proposition ll.2i Let Xq be a fixed starting state and let ^ : — > R be 

an eigenvector corresponding to the eigenvalue 1 — 7 with J^v^i'^Y ~ ^- "^^^^ 
means that if Xt is the state of the deck and XI is the position of card i at time t, 
then 

E[e(X,Vi)|X,] = (l-7)era. 

Let (j){Xt) = J2i^i^t) where the sum is taken over the i's for which ^(Xg) > 
0. Then is an eigenvector for (the transition matrix of) {Xt}. By assumption 
4>{^o) ^ IT-""- We claim that the terms are negatively correlated. Indeed, writing 

A, = eM-e(X,^-i),wehave 

E[A,A,|Xt_i] < 

since either A,; or A^ is unless card i and card j are adjacent in Xt and the edge 
between them gets chosen for the update, in which case AjAj < 0. Note also that 
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E[Ai\Xt-i] = -7e(^ti)- Hence 

By induction, 
Similarly 

naximmi)] = ii-imxmx^o) > {i-2^yaxmx'o)- 

Hence Cov{^{Xi),^{Xl)) < 0. Using this we have that 

Var(0(X,))<^E[eTO2]<l. 

i 

Since, by assumption, E[(/)(Xt)] = (1 — 7)*0(Xo) = for t := 67^^ logn, 

< b < a, Chebyshev's inequality entails that 

P(0(X,) < ^n^-') = o(l). 

On the other hand, if Xoo denotes a deck at uniformity, we have Var(0(Xoo)) < 1 
and E[0(Xoo)] = and hence 

P(0(Xoo) > \n^~') = oil). 

□ 

Proof of Lemma [L3l Assume for contradiction that = 0{n"-) for all a > 0. 
Then max^, ^{v) = ^l(n^"-) for all a > (assuming that we have chosen ^ so that 
max^, ^(1;) > — min^, ^(t>)). By assumption and the extremal characterization of 
eigenvalues 

so in particular max„^„ — i{v)\ = 0{n^^/'^). Then, if vq is chosen so 
that .^(^o) = ^{n~^f^), all vertices v within distance rr'^/^ /2 of Vq have ^(f) > 
^-6/8^2. If the diameter of G is less than n^^^^ /2, this implies that ^{v) > for 
all ^, a contradiction since J2v '^(^) i^iust be 0. If the diameter is at least 
then we have ||^|| 1 > rr'^/^/2 ■ n-^^^ /2 = n^/'^/A, a contradiction. □ 
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Proof of Theorem ll.4l fa). The upper bounds follow immediately from Proposi- 
tion O 

For the lower bounds, consider first the binary tree. Define r] : V ^ Why 
r]{v) = when v is the root, r]{v) = 1 — 2^'' when v is at distance k from the 
root, in the left third of the tree, r]{v) = — (1 — 2^'^) for vertices at distance k from 
the root, in the right third of the tree and r/ = on the middle third. Expressed 
differently, r](v) is the electric potential at v when we apply the potential +1 to 
the leafs of the left third of the tree and —1 to the vertices of the right third and 
regard all edges as unit resistors. 

Then = and — Also, since r] is harmonic off the 

leafs, dyii]{v) — Ylur^v vi'^) = when v is not a leaf. For leafs v, we have 

ri{v) (d^ri{v) - J] ri{u)^ ^ ^ ^ ^' 

Hence 

7 < = e(l/n% 

■m<c)[n) 

Plugging this into Proposition 11.21 and using Lemma [L3] gives 

T"mix = ^{n^ logn) 

as desired. 

We omit the analogous proof for r > 3, so let now G be a supercritical Galton- 
Watson tree conditioned on survival, where /i'^ = 9(n) and /i > 1 is the expecta- 
tion of the offspring distribution. Let m be a vertex with at least two children whose 
progeny survives, in the first generation where such a vertex exists. Let ui and U2 
be two arbitrarily chosen such children of u and let Ai and A2 be the sets leafs in 
the progeny of ui and U2 respectively. Let r](v), v E V,he the electric potential in 
V, with boundary condition r] = +1 on Ai and r] = —1 on A2. Using well-known 
facts about Galton-Watson processes, we have whp that, |v4i| = 6(/i'^) = ©(n), 
1^2! = 0(^) and that the effective resistance between Ai and is 6(1). Hence 
the total current flowing through edges incident to vertices of Ai and A2, is 0(1), 
i.e. the sum of the absolute potential differences over these edges is 6(1). Since 
T] is harmonic off Ai U A2, we get 
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We cannot, however, plug this into Proposition 11.21 since typically J2vV{^) is 
not 0. However replacing i] with </> := rj — fj, does not change the expression. 
Also, since \Ai\ = Q{n), it is clear that J^v'f^i^y ~ above, this gives 

7 = 0(l/n^) and hence, by Proposition 11.21 and Lemma [L3l 

T-mix = ^{n^ \ogn). 

□ 

Proof of Theorem |1.4| fb). Let first G = {V, E) be the first d generations of the 
incipient infinite tree of a critical Gallon- Watson process. It is well-known that 
whp, G contains Q{<P) vertices, i.e. d = 6(n^/^). 

The upper bound on Tmix now follows from Proposition ! 1.1[ 
Let A be the set of leafs at distance d from the root. By HTSl . Proposition 
1.1 and [8], Lemma 3.1, it follows that whp the effective resistance between the 
root and A is 0(n^/^). Let n be a vertex closest to the root for which at least two 
children are connected to A. Pick arbitrarily two such children, ui and U2 and 
let Ai be the set of vertices of A which are in the progeny oi Ui, i = 1,2. Whp 
\Ai\ = Q{d), see e.g. [8], Proposition 2.6, (since the finite variance condition on 
the offspring distribution makes sure that the survival probability up to generation 
d of the unconditioned Gallon- Watson process is Q(l/d)), and the effective re- 
sistance between Ai and A2 is at least Q{d). Apply the potential +1 to and 
— 1 to A2 and let ri{v) be the potential atv,v G V. Letting = r/ — rj, we get 
E„0(t^)' = eHand 

Y,{dMv)-J2^{u))=e{l/n'/') 

since the total current flowing into A^ is 6(n^^/^). Hence 7 = 6(l/n^/^), so by 
Proposition 1 1.2[ 

^mix = ^{n^'"^ \ogn). 

Now we turn to the random tree, G, on n vertices. It is well-known that the 
diameter of G is whp 0(n^/^), so the upper bound again follows from Proposition 

o 

By fS\, Proposition 1, the law of G is identical to the law of the Gallon- Watson 
tree with Poisson(l) offspring distribution, conditioned to contain exactly n ver- 
tices. Let d = Grt^l'^ . Whp G contains 0(n^/^) vertices at distance d from the 
root for small enough G. Fix u, ui and U2 as above, such that m is a vertex clos- 
est to the root for which at least two children are connected to level d. Continue 
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to mimic the above argument, letting Ai be the set of vertices at level d in the 
progeny of Ui. Define r] and (p as above. Then 0(f )^ = Q{n) whp and by 
ll23l . G is stochastically dominated by the incipient infinite cluster of the same 
Galton-Watson process. (Indeed, it is shown in |!23l that the trees conditioned on 
having exactly n vertices, = 1, 2, . . ., is stochastically increasing in n. This is 
shown for binomial(A;, l/k) offspring distribution for any k, so the Poisson(l) off- 
spring distribution follows as a limiting case.) Hence it follows that the effective 
resistance between Ai and A2 is whp 6(?2^/^). The proof is now completed by an 
appeal to Proposition 1 1.1[ □ 

Proof of Theorem 11.41 (c). Let G = (V, E) be the giant component of the Erdos- 
Renyi random graph G{N, (1 + e)/N), e = 0{l/N^/^). It is a classical result 
that whp \ V\ = Q(N'^/^). It is also known that G is a tree, uniformly distributed 
among all trees on V, with a probability bounded away from and 1. It is easy 
to see that if G is not a tree, then it contains whp only 0(1) cycles. (For detailed 
results on the giant component and random walks on the giant component for 
these parameters, see [|26ll .) Hence the results follow from the proof of (b) with 
only very minor modifications. □ 

Before the proof of (d) and (e), we introduce a random graph model due to 
Ding et el. (for e = o{l/N^/'^)) and UJI (extended version for e = o(l)) 
and further explored in [11 J. The model is contiguous to the giant component 
of the G{N, (1 + e)/N) model, uj{l/N^^^) = e = o(l), in the sense that any 
whp property for one is also whp for the other. The model is given by a three- 
step procedure as follows. Here /i is the conjugate of 1 + e, i.e. fi < 1 and 
He-^' = (l + e)e~(i+^). 

( 1 ) Let r be a normal random variable with mean 1 + e — ji and variance 1/ (en) . 
Let Du, u E [N], be iid Poisson(r) random variables, conditioned that 
X]m d„>3 even. Let M^, k = 1, 2, . . ., be the number of u such that 
Du = k and M = J2k>3 Select a random multigraph, H, on M vertices 
uniformly among all multigraphs with vertices of degree k, k > 3. 

(2) Let K be the multigraph obtained from H by replacing each edge e G E{H) 
with a path of independent geometric(l — /i) length. 

(3) Attach to each vertex v E V{K) an independent Galton-Watson tree with 
Poisson(/i) offspring distribution (with v as its root). 



Mixing times for the interchange process 



13 



Note that the variance of the normal distribution in (1) is of smaller order than 
the square of the mean, so whp T = 6(e). This entails that H contains whp 
9(e^A^) vertices and edges and hence K has whp e^A^ vertices. 

Proof of Theorem 1 1.4l fd). This proof is fairly long, so we split into four parts: 
lower bound (general), lower bound for of polynomial order, lower bound for 
of subpolynomial order and upper bound. 

Lower bound. Write F for the graph constructed from (l)-(3) above. We start 
with the lower bound. Write n = 1 — 6 and note that 5 = (1 + o(l))e. Let also 



where a is a constant independent of and e. Consider a Poisson(yu) tree. Let Zr 
be the size of the r'th generation and T the total size of such a tree. Condition now 
on that such a tree survives for at least k generations, by conditioning on the back- 
bone of the tree, i.e. the leftmost path of length k, P = VqVi . . .v^ from the root 
to generation k. (To properly make sense of the word "leftmost", associate with 
each vertex, v, of a Poisson(yu) tree, an independent Poisson process of intensity fx 
on the unit interval and let v have one child per point of this process. Then order 
the children from left to right according to the positions of their respective Pois- 
son points.) Then from each Vi on the backbone, an independent tree emanates, 
having Poisson(/i) offspring in all but the first generation, for which the offspring 
is Poisson(/i(l — Xj)), where Xi is the position in the unit interval of the Poisson 
point associated with f j. Let Zr^ be the size of generation r and T*-*^ the total size 
of the tree emanating from Vi, including Vi so thatT = Er=o^^'^- Then 



Since P is whp such that is less than, say, 2/3 for more than half the i's, we 



fco = \og{e^N/a) 



E[Z«|P] = (l-X,)/i^ 



and 




r=0 



get whp 



k 




(1) 



and henceE[T|P] 



> 0) 



e{k/6). By© 

nzk] 



0((i 



) 



E[Zk\Zk > 0] 



6(1/5) 
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which is at least C/ (e^N) for any large C if k — ko and the constant a in the 
definition of ko is small enough. Since there are whp Q{e'^N) vertices in K, at 
least two of the trees, attached to K in step (3) above, will whp survive for ko 
generations. 

Also, by the standard formula for the variance of the generation sizes of a 
Galton- Watson tree (modified for the first generation), whp, 

Var(Z«|P) < ^. 



Using that 

Cov(Z«,Z«|P) < (Var(Z«|P)Var(Z«|P))i/2 < ^ ^ ^ 

we get 

Var(r»|P) < 4 

so that 

Var(T|P) < ^. 

Hence by Chebyshev's inequality, we have whp 

T^e{ko/S)^e{e-nog{e'N)). 

Now pick uniformly at random, two attached trees that survive up to genera- 
tion ko and fix in each of them a leaf at distance at least ko from the root of it's 
tree. Apply the potential +1 to one of these leafs and —1 to the other. Let rj{v), 
V e V{F), be the resulting potential at v and, again, (f) = rj —fj. By the above 
considerations, = ^{e~'^ log{e^N)). We also get that 

Plugging into the extremal characterization of the second eigenvalue gives, for the 
random walk of a card, 

since whp n — eN. 
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Lower bound for e of polynomial order. Since we here assume that e~ = 
n{n°') for some a > 0, we appeal to Lemma [L3l By Proposition 11.21 it then 
follows that 



establishing the desired result. 

Lower bound for of subpolynomial order. Here more work is needed, since 
unfortunately, I have not been able to establish the required condition on the L^- 
norm of the second eigenvector in Proposition 1 1.2[ Nevertheless, a lower bound 
of Q{ne~^ log^(e^n) logn) = Q{ne~^ log^ n) can be established, using other ar- 
guments. Note that by the above construction, F contains 6(ne) = n(n''), b < 1, 
trees attached in (3). By the above arguments, whp, at least, say n^/^ of these trees 
will survive up to generation ae^^ logn and contain at least ae~^ \ogn vertices, 
provided that a is taken sufficiently small. For each such tree, Tj, i = 1, . . . , n^/^, 
pick a vertex h = hi half way (or half way plus 1/2) between the root, r = r^, 
of the tree and a leaf, / = U, most distant from r. Consider the random walk per- 
formed by the card, c = Cj, that starts in h. For vertices u E Ti, let r„ be the first 
time that c visits u. By the correspondence between random walks and electrical 
networks, F(ti < r,.) = 1/2. Hence, the probability that c walks to / and back to 
h at least k times before it visits r, is 2^'"' > 2n^^^^ for k = (1/3) logn. Write 
A = Ai for this event. Note also for later, that the probability that c visits / more 
than 2 logn times before it visits r is o(l/n), so whp this will not happen for any 
card in the deck. 

Conditioned on A, each of the k return trips to / takes an independent time, 
which is the sum of the time taken to reach / and the time to go from / back to h. 
The conditional expectation of the first of these terms is affected by the fact that 
we condition on hitting / before r. The distribution of the second term however, 
is unaffected by conditioning on A, since a random walk from / must necessarily 
visit h before it visits r. Hence the expected time of a return trip is bounded from 
below by E;r/i (in standard notation). By the hitting time formula from [fT9l , the 
tree version of Tetali's [|29l general hitting time formula, an exact expression for 
EiTh is 



Here d{u, v) is the distance between u and v, P is the set of vertices of the unique 
path between / and h and Gx is component containing x of the graph obtained from 
F by removing the neighbors in P of x. From the above arguments, it follows that 
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whp a non- vanishing fraction of the vertices in a; e P have \E{Gx)\ > Pe ^ (for 
sufficiently small /3) and are at distance > {a/3)e~^ log n from h. Hence, whp, 

¥.iTh = e{me-Hog''n). 

This entails that E;[Tr | A] — Q{ne~^ log^ n) (since m — Q{n) whp). We also have 

YsiTh{Tr\Ai) = ilog„Var(C(/i,/)|r, < r,)) + Var^,(T,) 

where C{u, v) is the commute time between u and v, i.e. the time taken for a 
return trip from utov and back. Let h* be the vertex which maximizes 'EC{x, I) 
over X E Ti. Here it is important to point out that here and in what follows until 
the order of Var/j(rr|A) is established, we consider the random walk restricted 
to Tj. Let h — 'EC{h*, l)/E,C{h, I) which is of constant order, since the effective 
resistance, i.e. distance, between h and I is of maximal order in T,. We get 

F{C{h,l) > kbEC{h,l)\Ti < Tr) < 2F{C{h,l) > kEC{h*,l)) <2-2-^l'^ 

since each round of 2EC(/i*, V) has, by Markov's inequality, a probability of at 
least 1/2 to finish the return trip. Hence 

Var^C(/i,/)|ri < r,) < E^{C{v,,lf\Ti < r,] = O {nh-'log'' n) . 

In the same way, this also goes for the term Ya,r ^{Tr), so summing up gives 

Var^,(r,|A) = 0(nV^log^n). 

By the above consideration, whp, Ai will occur for at least n^/^ of the T/s. Let S 
be the sum of the . 's from the trees Tj for which occurs. Then (whp) 

E,5 = e(n^/V^log^n) 

and, since the Aj's are negatively correlated (dependent only in that when a card 
moves in one attached tree, no card in another attached tree can move at the same 
time; in continuous time we would have had full independence) 

Var(5) = Oin'^^h-^log^n). 

Thus S > BQn^/^e~^ log^ n whp, by Chebyshev's inequality, for a suitable con- 
stant Bq. However, each term r^^ of S is, whp as noted above, the sum of at most 
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21og?T, independent C{h,l) conditioned on ti < Tr (plus an extra unconditioned 
Tr- at the end). Each such C(/i, /) is by the above arguments also bounded in distri- 
bution from above by ¥,C{h*, I) times an independent geometric (1/2) number. By 
the Chernoff bound, this means that P(rr^ < 6\ognEC{h*, /)) = !- o{l/n), i.e. 
whp no exceeds 6 \ognEC{h* , I) < Bine~^ log^ n for a constant Bi. Hence at 
least {BqI Bi)n^/^ of the terms of S must be at least Bquc^^ log^ n. This means 
that at time Bquc^^ log^ n whp 0(n^/^) cards q will be in the same attached tree 
as they started in. At stationarity, the expected number of such cards is less than 
1. This proves that 

Tmix > Bone'^ log^ n 

as desired. 

Upper bound. Consider first the random multigraph, H, constructed in (1). By 
a result of O, on any bounded-degree graph no -vertex expander, one can choose 
paths between each pair I < i < j < uq, of vertices, such that no edge is 
used by more than Bn^ log no of these paths, where B is some constant. Now we 
want to apply this to H . It is well-known that H is whp an expander. However, 
it is not bounded-degree. On the other hand, if we replace each vertex v of H, 
with dy greater than, say, D, with a [rf^/ D'] -vertex expander and divide the edges 
to V equally and uniformly at random among these neighbors (modulo integer 
considerations), we still whp have an expander with 0{n) vertices. In the sequel, 
we assume that H has these low congestion paths and condition on that event. 

Now extend H to K and then to F according to (2) and (3) above. Let each 
edge of H be associated with one of its end vertices and let each vertex vi of F be 
associated with the vertex of H, that the edge in H from which vi, or the attached 
tree containing vi, arose, is associated with. Let X^, v E V{H), be the number of 
vertices of F that are in this way associated with v. Construct paths between each 
pair (ui, vi) of vertices of F by concatenating the path chosen above between u 
and V, with the shortest paths between ui and u and vi and v, where u and v 
are the vertices of H that ui and vi are associated with, respectively. Let L^, be 
the number of such paths that contain e, e G E{K). Fix an e e E{K) and let 
Pi, ... , Pa, a < BMlogM, be an enumeration of the chosen paths of H that 
contain e. For convenience, define arbitrarily for each Pi, one of Pj's end vertices 
as the start vertex and the other as the end vertex. Let Ui and Vi be Pi's start and 
end vertex respectively. Then 

A 

Le < XujXy.. (2) 
1=1 
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Each is a sum of at most C independent random variables, Yj{v), where each 
Yj(v) is the sum of an independent geometric(5) number of independent random 
variables having the distribution of T, the total size of a Poisson(l — 5) Galton- 
Watson tree. Here j ranges over E{v), the set of edges of H associated with v. 
Hence 

A 

By Cauchy-Schwarz, 

«=1 j&E{ui) k£E{vi) i=l jeE{ui) keE{vi) 

In each of the two sums on the right hand side of ([3]), each Yj(v) may appear with 
multiplicity, arising from that the corresponding Xy may appear several times 
in (|2l). It follows that the multiplicity is bounded by 2\E(H)\ which in turn is 
bounded whp by 8M. Fix one of the right hand side sums arbitrarily. Let Si, 
1 = 1,..., 8M be the set of l^ (v)'s that appears in the fixed sum with multiplicity 
at least I. Then the Si's are decreasing and our fixed sum can be written in a more 
neat form as 

EE^? w 

1=1 ieSi 

where Zi, . . . , Zsm are iid with the distribution of Yj{v). To bound the size of the 
Z/s we need to control the total size T of an attached tree and sums of indepen- 
dent T's. The following is standard and can be found e.g. in ||20l . 

By the usual encoding of a Galton-Watson process into a random walk, it 
follows that T has the distribution of the time of the first visit to of a simple 
random walk, starting from 1 and with step distribution Poisson(l — 5) — l. Hence 
the sum Z = Yl^=i '^r, where the T^ 's are independent and distributed as T, has 
the distribution of the first visit to 0, starting from W. The Z/s in (Hj) have this 
form, with W being geometric(5). After steps of such a random walk, the 
position of the walker has expectation W — j5^^ and standard deviation less than 
y/j5^^. By the Central Limit Theorem, the probability that the walker at this time 
is above the x-axis, is bounded by 6"^^""^^/^^+/^. Hence 
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Taking expectations gives 

The probability generating function of is 5/(1 — s(l — S)). Plugging in s = 

6-5/2 ^ (i + o(l))(l + (5/2), gives 

> j6-^) < (1 + o(l))2e-^/^ < e^'^/\ (5) 

Next we turn to the sum XlieS; ^i- ^ large integer sq. For 15*;! < sq, 
we have by Q that the probability that Zj > So5~^ for any j E Si is bounded by 
Soc^"*"/"^. Hence, for such I, we have X]je5, — ^o^~^ '^^^^ probability at least 

1 - soe^-^o/^ 

Consider now s := l^^l > sq + 1. Order the Z/s in Sf. > . . . > Z(s)- We 
have 

P(^(r) > < P(Bin(s, e^"^'/^) > r) < ^^-^ < 

whenever j > 4((l + 3/r) logs — log r + 2). Hence < 45~2(^(^x _|_3y'j") logs- 
log r + 2) for all r = 1, . . . , s with probability at least 1 — s^^. Hence, with 
probability at least 1 — s"^^ by standard integral approximation, 

< 16r^E((l + -)log^-l°S^ + 2)^< (160s + O(log^s))5-^ 

je5z r=l 

< i6isr^ 

provided that sq is large enough. The probability that this holds for all s = sq + 
1, . . . , 8M is hence at least 1 — Xl^so+i "^"^ ^ ^ ~ 7^- Summing up in ([5]), using 
(HI) and noting that there are whp 0(M log M) terms in each of the sums on the 
right hand side of ©, we get 

8M 

Le < Yl ^0^"'+ E lQMSi\S-'<[st + l6lY,\Si\)s-' 

l:\Si\<so l:\Si\>so 1=1 

= 0(r^M log M) = e{e-^n\og{e^n)) 

with probability at least 1 — 2(soe~*° + I/sq) , i.e. with probability arbitrarily close 
to 1 on choosing sq sufficiently large. This bounds the number of paths connecting 
all pairs of vertices of F that an edge of K can appear in. It remains to check 
in how many paths an edge of an attached tree can appear. However, given an 
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attached tree, the edge that must be used most frequently is one which is incident 
to the vertex of K. This edge is used in less than tn paths, where t is the size 
of the attached tree. By the above, the largest tree has whp size 9(e~^ log(e^n)), 
so no edge of an attached tree is used by more than ©(e^^n log(e^n)) times. By 
the above, we also know that whp the longest path in F is of order e^^ log(e^n). 
Altogether, whp 

A, = e(e-3log2(e2^)). 
Plugging this into the comparison lemma gives whp 

i"mix <r = 0(ne"^log^(e^n) logn) 

as desired. □ 

Proof of Theorem |1.4r e). This part is essentially a simplification or the proof of 
part (d). It is easy to show that the giant component G = {V, E) of G{N, (1 + 
e) /N) whp contains f2(n^/^) sticks (i.e. induced paths with one end vertex having 
degree 1) of length at least a log n if the constant a is taken small enough. See 
e.g. [17]. Letting these sticks play the role of the attached trees T^, in the lower 
bound part in the proof of (d), establishes that 

Tmix = Vt{n\o^n). 

Now turn to the upper bound. The construction (l)-(3) above does not in this case 
exactly generate a graph contiguous to G. There is however, a characterization of 
G, due to Benjamini et al. [4] that works equally well for our purposes. According 
to Theorem 4.2 of [4], there is an a G (0, 1) such that whp G contains a subgraph 
B such that 

(i) B is an a-expander (i.e. for any W C V with \ W\ < n/2, the number of 
edges connecting to V is at least 

(ii) Letting Di denote the connected components of G\B and E'{Di) the set of 
edges with at least one endpoint in Di, the number of i for which \E'(Di) \ > 
k is bounded above by n(l + e)e~"'^, k = 1,2, . . .. 

(iii) Each v E B is connected to no more than 1/a different Di's 

Using (i), we have as above that all vertex pairs of B can be joined by paths in 
B in such a way that no edge of B is used in more than 0{n logn) paths. Then, 
by (ii) and (iii), extending this to joining all pairs of G with paths such that no 
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edge of G is used more than 0{n \ogn) times, goes through in a much simplified 
form, without most of the calculations involved. Plugging this into the comparison 
technique is done exactly as before and yields 

^mix = 0(n log^ n). 

□ 

Proof of Theorem ll.4r f). Here an appeal to Border et al. O, establishing that all 
pairs of vertices can be joined by paths so that no edge is used more than 0(log n) 
times, goes directly into the comparison technique, giving Tmix = 0{n log^ n). 

The lower bound follows exactly as the standard argument for the random 
transpositions shuffle: after time time (1/2— a)n logn whp at least n"/^ cards have 
never bee moved and are hence in their starting positions. However at stationarity, 
the expected number of such cards is 1, so the stationary probability for this event 
iso(l). □ 

Proof of Theorem ll.4f g). The lower bound was proved by Wilson ll30l . Indeed, 
with ^{v) = ^{vi, . . . ,Vd) = 2f 1 — 1, ^ becomes an eigenvector to an eigenvalue 
of order l/{n log n) for the walk of a single card. Thus Proposition ! 1 .2| establishes 
the lower bound. 

For the upper bound, we again use the comparison technique, comparing with 
the random transpositions shuffle. Since there are 6(?2^) pairs of vertices, the di- 
ameter of is d and the number of edges in Z2 is Q{nd), any automorphism 
invariant rule for assigning representations of arbitrary transpositions via short- 
est paths will have that any edge appears in Q{n) of these paths. Now use the 
comparison technique as before. □ 
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